植物 分 类 与 资源 学 报 2013, 35 (6): 751 ~760 


Plant Diversity and Resources 





裸子 植物 DNA 条 形 码 ITS2 高 通用 性 引物 的 筛选 * 


41,2 u 3 4e pee 
ROG. X T°, BH 
(1 中 国 科 学 院 昆 明 植物 研究 所 生物 多 样 性 与 生物 地 理学 重点 实验 室 ， 云 南 昆明 650201; 2 云南 省 农业 科学 院 
高 山 经 济 植 物 研 究 所 ， 云南 丽江 674100; 3 景德镇 学 院 ， 江 西 景德镇 333000) 



































摘要 : DNA 条 形 码 技术 是 利用 标准 DNA 片段 进行 准确 快速 鉴定 物种 的 一 种 方法 ， 理 想 的 DNA 条 形 码 片 段 
应 具有 高 通用 性 。 虽 然 核糖 体 DNA. 内 部 转录 间隔 区 IT (ITS2) 被 建议 作为 种 子 植物 有 效 的 DNA 条 形 码 ， 
但 目前 裸子 植物 还 没有 通用 性 高 的 引物 可 用 。 为 获得 高 通用 性 的 ITS2 引物 ， 本 研究 基于 裸子 植物 55 个 属 
的 5.8S 基因 的 保守 序列 区 设计 了 3 个 正 向 引物 ,与 已 有 的 ITS 反 向 引物 组 合 ， 组 成 了 7 对 ITS2 引物 进行 
通用 性 的 评价 。 选 取 了 裸子 植物 8 目 、12 科 和 40 属 的 56 个 种 用 于 本 文 的 研究 。 引 物 组 合 5. 8SRZITS4 、 
5. 8SRa/ITSA 和 5. 8SF2/S3R 因为 在 科 水 平 评价 中 通用 性 低 或 者 产生 的 PCR 产物 有 双 带 ， 因 而 排除 在 全 部 
物种 水 平 上 进一步 评价 。 其 余 4 对 引物 (GYM, 5. 8SF1/ITS4、GYM_5. 8SF1/S3R、GYM_5. 8SF2/ITSA 和 
S2F/S3R) 在 56 个 物种 的 PCR 检测 中 , 3545 100% 的 扩 增 率 。 基 于 PCR 产物 的 亮度 、 序 列 质量 和 正 反 向 
引物 覆盖 率 的 综合 评价 ， 建 议 引 物 GYM_S. 8SF2/ITS4 作为 裸子 植物 条 形 码 片 段 ITS2 最 好 的 通用 引物 。 
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Abstract; DNA barcoding is an approach for rapid and accurate species identification using a standard DNA region. 
Ideal barcodes should have high level of universality in PCR and sequencing. Although the nuclear ribosomal DNA 
(nrDNA) internal transcribed spacers 2 ( ITS2) was proposed as potential DNA barcode for seed plants, including 
gymnosperms, no suitable ITS2 primer pair for gymnosperms available shows high universality in either aspect. To 
obtain a highly universal ITS2 primer pair, we newly designed three forward primers for ITS2 based on sequences in 
the 5. 8S gene greatly conserved among 55 genera of gymnosperms. Combined with previously published reverse 
primers of ITS, seven candidate primer combinations for ITS2 were assessed for their universality. A total of 56 spe- 
cies, representing 40 genera from all the 12 families of the eight orders of gymnosperms , were sampled in this study. 
Three ITS2 primer combinations, 5. 8SR/ITS4, 5. 8SRa/ITS4, and 5. 8SF2/53R, were excluded due to their low 
universality or because of yielding double bands in the PCR in the family-level assessment. The remaining four prim- 
er combinations, GYM 5. 8SFI/ITS4, GYM_5. 8SF1/S3R, GYM 5.8SF2/ITS4, and S2F/S3R, showed high PCR 


universality ( 10096 ) on all 56 sampled species. Based on our overall assessment of PCR and sequencing universali- 
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ty, brightness of PCR bands, and sequence quality and coverage, we propose the primer combination GYM 5. 8SF2/ 


ITS4 as ITS2 barcode for gymnosperms because it was shown to be the most “universal” . 


Key words: DNA barcoding; Gymnosperms; ITS2; Universal primer; Species identification 


DNA barcoding is an approach to identify spe- 
cies and/or discover new species using one or a few 
standardized DNA regions from different genomes 
( Hebert et al., 2003; Hollingsworth, 2011; Gao et 
al., 2012) , which has been widely used in the field 
of taxonomy, ecological surveys and assessment of 
biodiversity ( Hebert et al. , 2004; Liu et al. , 2011; 
Lahaye et al., 2008; Valentini et al., 2009; 
Garcia-Robledo et al., 2013). The ideal DNA bar- 
codes should satisfy the criterion of an appropriately 
short sequence length (300—800 base pairs, bp) to 
facilitate PCR amplification with a single primer pair 
(universality ), be amenable to PCR and bidirec- 
tional Sanger sequencing with little requirement for 
manual editing of the sequence trace files ( sequence 
quality and coverage), and provide maximal dis- 
crimination among the species ( Kress et al., 2005; 
CBOL Plant Working Group, 2009). The mitochon- 
drial gene CYTOCHROME OXIDASE 1 ( COI) has 


€ è 
animal 


been proven to be a remarkably effective 
barcode” for discriminating species of birds, fishes, 
and insects ( Hebert et al., 2004; Holmes et al., 
2009; Linares et al. , 2009; Hollingsworth, 2011). 
However, it is more difficult to find a similarly ro- 
bust and effective barcode for plants ( CBOL Plant 
Working Group, 2009; Hollingsworth et al., 2011; 
Li et al., 2011a). Although many studies have com- 
pared the performance of a range of candidate DNA 
regions as barcodes (e. g. Sass et al., 2007; Fazek- 
as et al., 2008; Hollingsworth et al., 2009; Seberg 
and Petersen, 2009; Li et al., 2011b) , no single can- 
didate DNA locus evaluated fitted all criteria for seed 
plants. Thus, combinations of different DNA loci 
were suggested as barcodes for plants (Chase et al. , 
2007; Kress and Erickson, 2007; Hollingsworth et 
al., 2009; CBOL Plant Working Group, 2009). 
Kress et al. (2005) initially recommended the 
cpDNA trnH-psbA intergenic spacer and the nuclear 


ribosomal DNA ( nrDNA) internal transcribed spacer 
(ITS) region as barcodes for flowering plants. In 
2009, the Consortium for the Barcode of Life 
(CBOL) Plant Working Group proposed the combi- 
nation of the coding regions of rbcL 4 matK as the 
core barcode for land plants. Thereafter, the combi- 
nation of irnH-psbA and ITS2 ( internal transcribed 
spacer 2) was demonstrated to represent an effective 
barcode for species identification of medicinal plants 
in several studies (e. g. Chen et al., 2010; Pang et 
al. , 2010; Gao et al. , 2010). ITS2 was proposed to 
be used as a universal DNA barcode complementing 
COI to identify animal species (Yao et al., 2010) , 
and as complementary barcode to the core barcode of 
rbcL+matK for plants ( Hollingsworth et al. , 2011). 
Subsequently, based on the assessment of a large 
scale dataset, the China Plant BOL Group (Li et 
al., 2011b) backed the incorporation of ITS or ITS2 
into the core barcode for seed plants. 

Though many researchers have proposed the use 
of nrDNA ITS as a standard barcode for plants, there 
is a residual nervousness stemming from three major 
potential problems; i.e. fungal contamination, pres- 
ence of paralogous copies, and recovery (simply dif- 
ficult to amplify and sequence) ( Hollingsworth, 
2011). Of these, the main limitation is the some- 
times great difficulty to amplify and sequence the ITS 
region (Kress et al., 2005; Li et al., 2011b), per- 
haps due to non-conserved primer sites ( Möller, 
2000). For instance, the recovery rate of 76.5% was 
some 10% — 1596 lower than for the other barcode 
markers based on a 6286 sample dataset (Li et al., 
2011b). A high level of primer universality for poly- 
merase chain reaction ( PCR) and sequencing is one of 
the most important criteria for DNA barcoding ( Chase 
et al., 2007; Kress and Erickson, 2007; Ford et 
al., 2009; Hollingsworth et al., 2009). Thus, it is 


very important to select highly efficient primer pairs 


6 期 LI and GAO; Highly Universal DNA Barcoding Primers of ITS2 for Gymnosperms 753 





for the PCR amplification of ITS or ITS2. 
Gymnosperms are a class of seed-bearing plants, 
which are the dominant vegetation in many colder 
and arctic regions and have major economic uses as 
important wood resources and as ornamental plants. 
Cycadi- 


dae, Ginkgoidae, Gnetidae, and Pinidae, represen- 


The class consists of four subclasses, i. e. 


ting eight orders, 12 families, 83 genera, and ap- 
proximately 990 species ( Christenhusz et al., 2011). 
There are about 250 gymnosperm species in 34 gene- 
ra and 10 families in China (Fu et al., 1999). The 
great variation in length of the ITS region ( including 
ITS1, ITS2 and 5. 8S) in gymnosperms is possibly a 
limit for the use of the region as DNA barcode. 
However, length variation was predominantly ob- 
served for ITS1 (e.g. ranging from 630 to 3 125 bp 
in length in Pinus, Liston et al., 1999) , while the 
ITS2 and 5. 8S gene are more length conserved ( Lis- 
ton et al., 1996, 1999; Gernandt and Liston, 1999). 
Therefore, ITS2 seems a more likely candidate for a 
gymnosperm DNA barcode, when obtaining full ITS 
sequences is problematic. Another detrimental aspect 
of is the sometimes low universality of ITS when using 
universal gymnosperm PCR primers (Li et al., 
2011b), also reported for gymnosperm universal 
primers for ITS2 (Chiou et al., 2007; Chen et al., 
2010). Thus, a highly PCR universal primer pair 
for ITS2 is required if this region is even to be con- 
sidered suitable for gymnosperms. 

In the present study, we therefor newly designed 
and tested three forward primers, GYM | 5. 8SF1, 
GYM, 5.8SF2, and 5. 8SRa, based on sequences in 
the 5. 85 gene greatly conserved over 55 gymnosperm 
genera. We selected 56 species representing 42 gene- 
ra of all 12 gymnosperm families to test the new prim- 
ers. A total of seven candidate primer combinations 
for ITS2 were evaluated to determine their universality 
in PCR amplification across the 56 gymnosperm spe- 
cies and Sanger sequencing of seven species from the 
four subclasses of gymnosperms here. The ultimate 
aim was to be able to propose the most universal 


primer pairs of ITS2 for barcoding gymnosperms. 


Materials and methods 
Sample strategy 

To represent the large diversity across gymno- 
sperms, a total of 56 species representing 42 genera 
from 12 families of eight orders in gymnosperms were 
sampled in this study (Table 1) , covering around 
49% of genera, and all families, orders, and sub- 
classes of gymnosperms. All gymnosperm species oc- 
curring in China were covered. Voucher specimens 
were deposited at the herbarium of Kunming Institute 
of Botany, Chinese Academy of Sciences (KUN). 
For some larger genera, more than one species per 
genus were selected. 
DNA extraction 

Total DNA was extracted from silica-gel dried 
leaf tissues using the CTAB method ( Doyle and 
Doyle, 1987). The concentration of total genomic 
DNA was check using spectrophotometer ( ND-1000 , 
NanoDrop, Wilmington, USA), and then was dis- 
solved in TE buffer (10 mmol - L’ Tris-HCl, pH 8.0, 
1 mmol - L'' EDTA) to a final concentration of 40— 
50 ng * RL to avoid any variation in PCR success 
due to DNA concentration differences. 
Primer design and primer combination screening 
strategy 

Sequences of the 5. 8S gene and ITS2 for a total 
of 74 species, representing 55 genera from all the 12 
families of gymnosperms, were obtained from Gen- 
Bank. These sequences were aligned in Clustal X 
(Larkin et al., 2007). Three new forward primers 
(GYM, 5. 8SF1, GYM, 5. 8SF2, and 5. 8SRa, see 
Table 2) were designed in regions of the 5. 8S gene 
largely conserved across the gymnosperms included 
here with the help of Oligo 6.0 ( Offerman and Rych- 
lik, 2003). A further forward primer, 5. 8SR ( de- 
signed by Vilgalys lab; http://biology. duke. edu/ 
fungi/mycolab/primers. htm) was selected for the 
universality assessment in our study as well. In com- 
bination with the reverse primers ITS4 and S3R, we 
had six ITS2 primer combinations to test. The primer 
pair, S2F/S3R for ITS2 (Chiou et al., 2007) , pro- 


posed for plant barcoding in several studies previous- 
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ly (Chen et al., 2010; Gao et al., 2010; Pang et comparison to primer pairs including our newly de- 
al., 2010, 2012), was used as a bench mark in signed primers (Table 2). 


Table 1 Voucher information and sample code of the species included in this study, and polymerase 


chain reaction success for four ITS2 primer combinations 











Primer 

Subclass Family Taxon Voucher Code GYM_ GYM_ GYM_ say 

5.8SF1  5.8SF1 5. 8SF2 S3R 

ITS4 S3R ITS4 

Cycadidae Cycadaceae Cycas guizhouensis * Glm-06032 G1 +++ +++ +++ +++ 
Cycadidae Cycadaceae Cycas micholitzii Ly-012 G2 十 十 十 十 十 十 TL TL 
Cycadidae Zamiaceae Encephalartos lehmannii * Ly-014 G3 十 十 十 十 十 十 十 十 十 十 十 十 
Ginkgoidae Ginkgoaceae Ginkgo biloba* Ly-028 G4 + + +++ ++ 
Gnetidae Ephederaceae Ephedra intermedia 12384 G57 +++ +++ +++ +++ 
Gnetidae Ephederaceae Ephedra likiangensis * Ly-024 G55 十 十 十 十 十 十 十 十 十 十 十 十 
Gnetidae Gnetaceae Gnetum pendulum * GBOWS0801 G54 十 十 十 十 十 十 十 十 十 十 十 十 
Gnetidae Welwitschiaceae Welwitschia mirabilis * RBGE19941311 G59 +++ +++ +++ +++ 
Pinidae Araucariaceae Agathis dammara Gy058 G58 +++ 十 十 十 十 十 十 Tra 
Pinidae Araucariaceae Araucaria bidwillii Ly-015 G46 +++ +++ +++ +++ 
Pinidae Araucariaceae Araucaria cunninghamii * Ly-016 G45 +++ +++ +++ +++ 
Pinidae Araucariaceae Araucaria heterophylla Ly-004 G47 +++ +++ +++ +++ 
Pinidae Cupressaceae Calocedrus macrolepis var. formosana Glm-103074 G35 +++ +++ +++ +++ 
Pinidae Cupressaceae Chamaecyparis formosensis Glm-103065 G33 +++ 十 十 十 十 十 十 十 十 十 
Pinidae Cupressaceae Cunninghamia lanceolata Ly-007 G22 ++ ++ ++ ++ 
Pinidae Cupressaceae Cupressus duclouxiana Ly-011 G32 +++ +++ +++ +++ 
Pinidae Cupressaceae Cryptomeria japonica Glm-103142 G25 +++ +++ +++ +++ 
Pinidae Cupressaceae Fokienia hodginsii Ly-006 G31 +++ +++ +++ +++ 
Pinidae Cupressaceae Glyptostrobus pensilis * Ly-005 G26 +++ +++ +++ +++ 
Pinidae Cupressaceae Juniperus pingii var. wilsonii Ly-026 G38 +++ +++ +++ +++ 
Pinidae Cupressaceae Juniperus squamata * Glm-082126 G30 +++ +++ +++ +++ 
Pinidae Cupressaceae Metasequoia glyptostroboides Ly-002 G29 +++ +++ +++ +++ 
Pinidae Cupressaceae Platycladus orientalis Ly-020 G37 +++ +++ +++ +++ 
Pinidae Cupressaceae Sequoia sempervirens Ly-021 G28 +++ +++ +++ +++ 
Pinidae Cupressaceae Taiwania flousiana Glm-092411 G24 +++ +++ +++ +++ 
Pinidae Cupressaceae Taxodium distichum Ly-009 G27 +++ +++ +++ +++ 
Pinidae Cupressaceae Thujopsis dolabrata Ly-019 G34 +++ +++ +++ +++ 
Pinidae Cupressaceae Thuja sutchuenensis Ly-018 G36 +++ +++ +++ +++ 
Pinidae Pinaceae Abies georgei Glm-102904 G12 +++ +++ +++ +++ 
Pinidae Pinaceae Cathaya argyrophylla * Ly-017 G13 +++ +++ TL TL 
Pinidae Pinaceae Cedrus deodara Ly-029 G8 十 十 十 十 十 十 十 十 十 十 十 十 
Pinidae Pinaceae Keteleeria davidiana Ly-001 G14 +++ +++ +++ +++ 
Pinidae Pinaceae Keteleeria evelyniana Ly-003 G15 +++ +++ +++ +++ 
Pinidae Pinaceae Keteleeria fortunei Ly-010 G16 +++ +++ +++ +++ 
Pinidae Pinaceae Larix himalaica Glm-081604 G10 +++ +++ +++ +++ 
Pinidae Pinaceae Larix potaninii var. chinensis Glm-06081 G9 +++ +++ +++ +++ 
Pinidae Pinaceae Picea likiangensis Glm-08912 G19 +++ +++ +++ +++ 
Pinidae Pinaceae Picea smithiana Glm-081533 G18 +++ +++ +++ +++ 
Pinidae Pinaceae Pinus armandi Ly-027 G7 +++ 十 十 十 十 十 十 十 十 十 
Pinidae Pinaceae Pinus wallichiana Glm-081471 G5 十 十 十 十 十 十 十 十 十 十 十 十 
Pinidae Pinaceae Pinus yunnanensis * Ly-025 G6 +++ +++ 十 十 十 十 十 十 


Pinidae Pinaceae Pseudolarix amabilis Ly-008 Gll 十 十 十 十 十 十 十 十 十 十 十 十 
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Table 1 continued 
Primer 

Subclass Family Taxon Voucher Code GYM_ GYM_ GYM_ SF 

5.8SF1  5.8SF1 5.8SF2 SR 

ITS4 S3R ITS4 

Pinidae Pinaceae Pseudotsuga sinensis Glm-103063 G17 Tt 十 十 十 十 十 十 十 十 十 
Pinidae Pinaceae Tsuga dumosa Glm-081797 G21 十 十 十 十 十 十 十 十 十 十 十 十 
Pinidae Podocarpaceae Dacrycarpus imbricatus Ly-023 G39 十 十 十 十 十 十 十 十 十 十 十 十 
Pinidae Podocarpaceae Nageia fleuryi * Ly-022 G40 十 十 十 十 十 十 十 十 十 十 十 十 
Pinidae Podocarpaceae Nageia nagi Ly-013 G41 十 十 十 十 十 十 十 十 十 十 十 十 
Pinidae Podocarpaceae Podocarpus macrophyllum GBOWS0294 G42 +++ +++ +++ +++ 
Pinidae Podocarpaceae Podocarpus neriifolius Glm06211 G43 +++ +++ +++ +++ 
Pinidae Sciadopityaceae Sciadopitys verticillata * Glm-092241 G44 +++ +++ TL +44 
Pinidae Taxaceae Amentotaxus argotaenia 182 G48 十 十 十 十 十 十 十 十 十 十 十 十 
Pinidae Taxaceae Cephalotaxus mannii ZSD001 G53 +++ +++ +++ +++ 
Pinidae Taxaceae Cephalotaxus wilsoniana Glm-103119 G52 十 十 十 十 十 十 十 十 十 十 十 十 
Pinidae Taxaceae Pseudotaxus chienii * Glm-07549 G51 十 十 十 十 十 十 十 十 十 十 十 十 
Pinidae Taxaceae Taxus wallichiana RC1289 G50 +++ +++ +++ +++ 
Pinidae Taxaceae Torreya fargesii var. yunnanensis Glm-092567 G49 +++ +++ +++ +++ 


Species in bold were selected for sequencing to evaluate sequence quality and coverage in this study. " Species were selected for evaluating PCR 


success using the seven candidate ITS2 primer combinations at family-level. +++, strong band; ++, medium band; +, weak band 


Table 2 The seven combinations of ITS2 primers tested for universality in this study 








Length i anneali 

Primer name Direction Sequence (5'-3') ane PORUM qne Source 
/bp temperature/°C 

GYM_5.8SF1 F TTGYAGAATCCCGTGARTC 19 A8 Designed in this study 
ITS4 R TCCTCCGCTTATTGATATGC 20 White et al., 1990 
GYM. 5.8SF1 F TTGYAGAATCCCGTGARTC 19 50 Designed in this study 
S3R R GACGCTTCTCCAGACTACAAT 21 Chiou et al., 2007 
GYM_5.8SF2 F GYAGAATCCCGTGARTCATC 20 52 Designed in this study 
ITS4 R TCCTCCGCTTATTGATATGC 20 White et al., 1990 
GYM_S. 8SF2 F GYAGAATCCCGTGARTCATC 20 52 Designed in this study 
S3R R GACGCTTCTCCAGACTACAAT 21 Chiou et al., 2007 
5.8SR F TCGATGAAGAACGCAGCG 18 52 Vilgalys lab 
ITS4 R TCCTCCGCTTATTGATATGC 20 White et al., 1990 
5. 85Ra F WCGATGAAGAACGYAGCG 18 52 Designed in this study 
ITS4 R TCCTCCGCTTATTGATATGC 20 White et al., 1990 
S2F F ATGCGATACTTGGTGTGAAT 20 52 Chiou et al., 2007 
S3R R GACGCTTCTCCAGACTACAAT 21 Chiou et al., 2007 


F, forward; R, reverse. R-A/G, W-A/T, Y-C/T; bp-base combinations 


To determine the optimum annealing tempera- 
ture, gradient PCR (annealing temperature ranging 
from 46 ^C to 58 C with a 2 C gradient) was per- 
formed using the seven primer combinations on four 
species, Cycas guizhouensis, Ginkgo biloba, Pinus 
yunnanensis, and Gnetum pendulum, representing 
the four gymnosperm subclasses ( Table 1). The 


gradient PCR reactions were carried out twice to con- 


firm the optimum annealing temperatures. We then 
adopted a two-step approach to determine the univer- 
sality of the six primer combinations (Li et al., 
2011). First, we selected 14 species representing 
all the 12 families for PCR amplification with all the 
candidate ITS2 primer combinations. For the large 
families Pinaceae and Cupressaceae, two species 


were used in the family-level assessment (Table 1). 
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Second, the candidate primer combinations per- 
formed well with a single PCR band and high PCR 
success rate were selected for further assessment on 
all the remaining sampled species. 
PCR amplification and sequencing 

All PCR reactions were carried out with a 2720 
Thermal Cycler ( Applied Biosystems, Foster City, 
CA, USA). The final reaction volume of 20 uL con- 
tained 0.5 uL each of the forward and reverse prim- 
ers (5 pmol - L), 10 pL 2xTaq PCR Master Mix 
(0. 1 U Taq polymerase * iL! , 0.5 mmol +L” each 
dNTP, 20 mmol - L'' Tris-HCl (pH 8.3) , 100 mmol 
-L' KCl, 3 mmol - L' MgCl,; Tiangen Biotech, 
Beijing, China) , 0.2 uL bovine serum albumin (10 
wg? pL) and 1 pL template DNA. Negative con- 
trols were run alongside all PCR reactions. The ther- 
mocycling profile for ITS2 was as follows: 94 *C for 
3 min, then 35 cycles of 94 *C for 1 min, 46—58 *C 
(in the gradient PCR) or the primer pair specific 
optimum annealing temperature ( Table 2) for 45 s, 
72 *C for 1 min, and a final extension step of 72 °C 
for 7 min. The PCR products were analyzed by elec- 
trophoresis on 1.2% Tris-acetate-EDTA agarose gels 
stained with ethidium bromide alongside a GeneRul- 
er 100 bp DNA ladder ( Fermentas, Glen Burnie, 
MD, USA). 

The PCR products were purified using ExoSAP- 
IT (GE Healthcare, Cleveland, OH, USA). Se- 
quencing reactions were carried out on forward and 
reverse strands in a total volume of 6 wL containing 
0.2 uL of the purified PCR product, 0. 15 uL of 
BigDye terminator sequencing mixture (V3.1), 1.2 
uL of sequence buffer, and 1. 4 uM of primer). 
The Sanger sequencing profile was 32 cycles of 96 °C 
for 10s, 50 C for 5 s, and 60 C for 4 min. The pu- 
rified sequencing products were run on an ABI 
3730xl automated sequencer ( Applied Biosystems, 
Foster City, USA). 
Analysis of PCR and sequencing success, sequence 
quality and coverage 

Recording of the PCR and sequencing success 


followed the methods of Li et al. (2011b). The se- 


quence quality was assessed using the software Se- 
quencing Analysis 5. 3. 1 ( Applied Biosystems, Fos- 
ter City, USA) and followed Li et al. (2011). The 
percentage of bases with >QV30 (QV, an estab- 
lished metric for determining quality sequencing da- 
ta) and the coverage of the bidirectional sequences 
was determined using Sequencher 5. 0 ( Gene Codes 
Corporation, Ann Arbor, MI, USA). 


Results 
Optimum annealing temperature of the seven 
candidate primer combinations 

The optimum annealing temperature for the can- 
didate primer combination was determined based on 
the success rate of the PCR reactions and the bright- 
ness of the PCR bands ( assumed to reflect amplifi- 
cation efficiency ). In this study, the optimum an- 
nealing temperature for the primer combinations 
GYM 5.8SF1/ITS4 and GYM, 5. 8SFI/S3R was 48 °C 
and 50 °C, respectively. The remaining five primer 
combinations showed the same optimum annealing 
temperature of 52 ^C. (Table 1). 
Universality of the candidate primer combinations 

In the universality assessment at the family-lev- 
el on 14 samples, primer pair 5. 85R/ITS4 yielded 
strong PCR bands for seven samples (5096) , weak 
PCR bands for five samples (35. 796) , and no bands 
for two species ( Ginkgo biloba and Sciadopitys verti- 
cillata). Though the primer combinations of 5. SSRa/ 
ITS4 and GYM, 5. 8SF2/S3R produced bands for all 
14 samples (100% ), the PCR products for some 
species showed double bands on the agarose gel. 
Because of the low universality and the amplification 
of more than one band, these three primer pair were 
excluded from further assessments. The remaining 
four ITS2 primer combinations (GYM , 5. 8SF1/ 
ITS4, GYM _5. 8SFI/S3R, GYM. 5. 8S5F2/ITS4, 
and S2F/S3R) showed 10096 PCR success ( Table 
1), though the primer combinations GYM_5. 8SF1/ 
ITS4 and GYM, 5. 8SFI/S3R produced a weak PCR 
band for Ginkgo biloba (Table 1). The primer com- 
bination GYM, 5. 8SF2/ITS4 yielded a strong PCR 
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band and S2F/S3R a medium bright PCR band for single strong PCR band per reaction except for Cun- 
Ginkgo biloba ( GA) , respectively ( Figure 1). The ninghamia lanceolata ( G22) for which consistently 


remaining sampled species showed a single strong medium bright PCR band were obtained (Figure 1, Ta- 
PCR band on the family-level assessment for the four ble 1). Thus, the overall universality assessment 
primer combinations (Figure 1, Table 1). across all 56 species for the four candidate primer com- 


The universality of the four ITS2 candidate primer binations was 100% PCR success. Considering the 
combinations performed uniformly well on the remai- band intensities, primer combination GYM 5. 8SF2/ 
ning 42 sampled species with 100% PCR success and a ITSM performed best, followed by the pair S2F/S3R. 


GYM 5. 8SF1/S3R 
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Fig.1 PCR amplification profiles of ITS2 with primer combinations GYM 5.8SF1/S3R, GYM_S. 8SF2/ITS4, and S2F/S3R, on all 56 
species sampled for this study. Sample code as in Table 1. C means negative control for PCR, M means DNA ladder (100 bp) 
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Sequencing success and sequence quality 

The assessment of the sequencing universality 
and the quality of the generated sequence traces for 
the two best PCR performing ITS2 primer combina- 
tions, GYM_5. 8SF2/ITS4 and S2F/S3R, demon- 
strated 100% sequencing success with the seven 
species tested. All the 28 generated sequence traces 
of the seven tested species were of high base call 
quality, with few low-quality bases needing trimming 
out. The complete amplicon length for the two primer 
combinations ranged from 380 bp ( GYM _5. 8SF2/ 
ITS4) to 510 bp (S2F/S3R), including primer sites, 
for Cycas guizhouensis. When using the default set- 
tings for sequence trimming in Sequencher 91. 2 - 
99.4% and 95. 5-99. 596 of the bases (QV>30) 
remained with primer combination GYM_5. 8SF2/ 
ITS4 and S2F/S3R, respectively. Thus, the se- 
quence quality was very high and similar for the two 
primer combinations. The assembled contigs showed 
>70% overlap in the alignment of the forward and 
reverse reads without low-quality base calls for any 


of the generated ITS2 sequences. 


Discussion 

The nuclear ribosomal DNA (nrDNA) internal 
transcribed spacer (ITS) region is one of the most 
used DNA region for molecular systematics and evo- 
lution ( Álvarez and Wendel, 2003; Kress et al., 
2005; Nieto et al., 2008). It can provide signifi- 
cant sequence variability at the species level or lower 
(Yao et al., 2010; Liu et al., 2011). Because of 
the greater discriminatory power of nrITS over plastid 
regions at low taxonomic levels among plant species, 
it has been proposed as a DNA barcode in several 
studies ( Roy et al., 2010; Li et al., 2011b; Muell- 
ner et al. , 2011; Simon et al., 2012). The remark- 
able length variation of ITS in gymnosperms ( Liston 
et al., 1996, 1999; Gernandt and Liston 1999) is 
an obvious obstacle for a DNA barcode ( Liu et al. , 
2011). The "universal" ITS primer pair for gymno- 
sperms has revealed low level of PCR and sequen- 


cing success (Li et al., 2011b). It limited the use 


of ITS as a barcode in gymnosperms even though it 
was proposed as a core barcode for seed plants ( Li et 
al., 2011b). However, ITS2, the second spacer of 
the ITS region, has been suggested as an effective 
DNA barcode for plants (Chen et al., 2010; Pang 
et al., 2010; Yao et al., 2010; García-Robledo et 
al., 2013). If ITS in its entirety is difficult to am- 
plify and directly sequence, ITS2 could be an alter- 
native for gymnosperms because it is shorter and eas- 
ier to sequence than the entire ITS region ( Coleman 
et al., 2003). Therefore, ITS2 can represent a use- 
ful back-up barcode of ITS (Li et al., 2011b). 

The ITS2 primer pair used for gymnosperms 
showed relative low PCR and/or sequencing success 
in previous study (Chen et al., 2010; Pang et al., 
2012). It is therefore the lack of a highly universal 
primer pair for ITS2 that prevents its wider use in 
gymnosperms. Thus, primer universality is one of 
the first to fulfilling criteria when screening for a 
DNA barcode ( Chase et al., 2007; Kress and 
Erickson , 2007; Ford et al. , 2009; Hollingsworth et 
al., 2009). In this study, we designed three new 
primers based on sequences of the 5. 8S gene greatly 
conserved across 55 genera of gymnosperms. When 
the polymorphic site exists within the primers re- 
gion, we used the combined base in the prime ( Ta- 
ble 2). Four of the seven candidate ITS2 primer 
combinations here performed very well with 10096 
PCR success for all the tested 56 species covering all 
the families of the eight orders of gymnosperms. On- 
ly Ginkgo biloba and Cunninghamia lanceolata. yiel- 
ded weak or medium strong PCR bands for the four 
primer combinations. Ginkgo biloba is the only living 
representative of the family Ginkgoaceae with an iso- 
lated systematic position in the gymnosperms. This 
distant genetic relationship of Ginkgo biloba with 
other groups of gymnosperms may have resulted in 
changes in the annealing sites for most combinations 
(except for GYM_5. 8SF2/ITS4) and may explain 
the weak or medium PCR band of the candidate ITS2 
primers for this species (c. f. Möller, 2000). The 


case for Cunninghamia lanceolata ( Cupressaceae ) 
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is likely different, because all other sampled species 
of Cupressaceae yielded strong PCR bands. A rela- 
tively poor quality of template DNA used here or the 
presence of inhibitors might be responsible for the 
consistently weak PCR bands of Cunninghamia lan- 
ceolata. 

Based on the PCR success and yield of the PCR 
products of the four ITS2 primer combinations fully 
tested, two combinations GYM | 5. 8SF2/ITS4 and 
S2F/S3R_ performed slightly better. Of these two 
GYM_5. 8SF2/ITS4 performed slightly better than 
the benchmark primer pair S2F/S3R, and yielded 
strong PCR band for most sampled species here. 
Both primer combinations, GYM, 5. 8SF2/ITS4 and 
S2F/S3R, performed equally well in their universal- 
ity of sequencing. The generated sequences showed 
high levels of sequence quality and coverage as well, 
which is also an important criterion for DNA barcode 
screening. Based on our overall assessments of prim- 
er universality and sequence quality and coverage, 
we propose GYM, 5. 8SF2/ITS4 as the most “ univer- 
sal" primer pair for ITS2 for barcoding in gymno- 


sperms. 
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